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ASYMPTOTIC PROPERTIES OF U-PROCESSES UNDER 
LONG-RANGE DEPENDENCE 

By C. Levy-Leduc*'*, H. Boistard § , E. Moulines* , M. S. 
TAggu^ and V. A. ReisenII 

CNRS/LTCI/TelecomParisTech *, Toulouse School of Economics 
Boston University ^ and Universidade Federal do Espirito Santo " 

Let (Xi)i>i be a stationary mean-zero Gaussian process with 
covariances p(k) = ^{XiXk+t) satisfying: p(0) = 1 and p(k) = 
k-°L{k) where D is in (0, 1) and L is slowly varying at infinity. 
Consider the [/-process {U„(r), r € 7} defined as 



Un(r)= 1 Y, 1 



n(n — 1) L — ' 



where 7 is an interval included in R and G is a symmetric function. 
In this paper, we provide central and non-central limit theorems for 
U n - They are used to derive, in the long-range dependence setting, 
new properties of many well-known estimators such as the Hodges- 
Lehmann estimator, which is a well-known robust location estimator, 
the Wilcoxon-signed rank statistic, the sample correlation integral 
and an associated robust scale estimator. These robust estimators 
are shown to have the same asymptotic distribution as the classical 
location and scale estimators. The limiting distributions are expressed 
through multiple Wiener-Ito integrals. 



1. Introduction. Since the seminal work by Hoeffding (1948), [/-statistics 
have been widely studied to investigate the asymptotic properties of many 
statistics such as the sample variance, the Gini's mean difference and the 
Wilcoxon one-sample statistic, see Serfling (1980) for other examples. One 
of the most powerful tools used to derive the asymptotic behavior of U- 
statistics is the Hoeffding's decomposition [Hoeffding (1948)]. In the i.i.d and 
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weak dependent frameworks, it provides a decomposition of a [/-statistic 
into several terms having different orders of magnitudes, and in general 
the one with the leading order determines the asymptotic behavior of the 
[/-statistic, see Serfling (1980), Borovkova, Burton and Dehling (2001) and 
the references therein for further details. A recent review of the properties 
of [/-statistics in various frameworks is presented in Hsing and Wu (2004). 
In the case of processes having a long-range dependent structure, decompo- 
sition ideas are also crucial. However, in the case of Gaussian long-memory 
processes, the classical Hoeffding's decomposition may not provide the com- 
plete asymptotic behavior of [/-statistics because all terms of this decom- 
position may contribute to the limit, see for example Dehling and Taqqu 
(1991). In this case, the asymptotic study of [/-statistics can be achieved by 
using an expansion in Hermite polynomials, see Dehling and Taqqu (1989, 
1991). For a large class of processes including linear and nonlinear processes, 
a new decomposition is discussed in Hsing and Wu (2004). These authors 
use martingale-based techniques to establish the asymptotic properties of 
[/-statistics. 

A very natural extension of [/-statistics (which are random variables) 
is the notion of [/-processes which encompasses a wide class of estimators. 
For example, Borovkova, Burton and Dehling (2001) study the Grassberger- 
Proccacia estimator which can be used to estimate the correlation dimension. 
In Section 5 of their work, the authors investigate the asymptotic proper- 
ties of [/-processes when the underlying observations are functionals of an 
absolutely regular process, that is, short-memory processes. As far as we 
know, the asymptotic properties of [/-processes in the case of long-range 
dependence setting have not been established yet, and this is the heart of 
the research discussed in this paper. More precisely, our contribution con- 
sists first in extending the results of Borovkova, Burton and Dehling (2001) 
in order to address the long-range dependence case, second in extending the 
results obtained in Dehling and Taqqu (1989) to functions of two variables 
and third in extending the results of Hsing and Wu (2004) to [/-processes. 
The authors of the latter paper establish the asymptotic properties of U- 
statistics involving causal but non necessarily Gaussian long-range depen- 
dent processes whereas, in our paper, we establish the asymptotic properties 
of [/-processes involving Gaussian long-range dependent processes. The au- 
thors in Hsing and Wu (2004) use a martingale decomposition and we use a 
Hoeffding decomposition or a decomposition in Hermite polynomials. In the 



U-PROCESSES UNDER LONG-RANGE DEPENDENCE 



3 



proof section, we also present an extension of some results of Soulier (2001). 
Consider the [/-process denned by 



where I is an interval included in R, G is a symmetric function i.e. G(x, y) = 
G(y, x) for all x, y in R, and the process pQ)i>i satisfies the following as- 
sumption: 

(Al) (Xi)i>i is a stationary mean-zero Gaussian process with covariances 
p(k) = E(X 1 Xk+i) satisfying: 

p(0) = 1 and p(k) = k~ D L(k), < D < 1 , 

where L is slowly varying at infinity and is positive for large k. 
Note that, for a fixed r, U n (r) is a [/-statistic based on the kernel h(-,-,r) 



We show in this paper that the asymptotic properties of the [/-process U n (-) 
depends on the value of D and on the Hermite rank m of the class of func- 
tions {h(-, -,r) — U(r),r G /}, defined in Section 2. We obtain the rate of 
convergence of U n (-) and also provide the limiting process when D > 1/2, 
m = 2 and when D < 1/m, m = 1,2. The convergence rate in the for- 
mer case is of order y/n whereas it is of order n mD//2 /L(n) m / 2 in the latter. 
These results are stated in Theorems 1 and 2, respectively. They are applied 
to derive the asymptotic properties of well-known robust location and scale 
estimators such as the Hodges-Lehmann estimator [Hodges and Lehmann 
(1963)] and the Shamos scale estimator proposed by Shamos (1976) and 
analyzed by Bickel and Lehmann (1979). These properties are illustrated in 
Levy-Leduc et al. (2010a) using numerical experiments. Theorems 1 and 2 
allow us to establish novel asymptotic properties on these estimators in the 
long-range dependence context. The most striking result is that these robust 
estimators have the same asymptotic distribution as the classical estimators, 
see Propositions 5 and 8 in Section 4. 

Theorems 1 and 2 have also been used to derive the asymptotic distribu- 
tion of a robust scale estimator proposed by Rousseeuw and Croux (1993) 



(1) 




where 



(2) 



h{x,y,r 



) = MG{x,y)<r} ,Vx,j/ G R and r G I . 
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and a robust autocovariance estimator introduced in Ma and Genton (2000). 
The robustness and efficiency properties of these estimators have also been 
investigated through numerical experiments and real data analysis. For fur- 
ther details on these theoretical and numerical studies, we refer the reader 
to Levy-Leduc et al. (2010b). 

The paper is organized as follows. In Section 2, the main theorems 1 and 2 
are stated. In Section 3, we derive the asymptotic properties of some quantile 
estimators. Section 4 presents new asymptotic results in the context of long- 
range dependence. In this section, central and non-central limit theorems 
are provided for several statistics as an illustration of the theory presented 
in Sections 2 and 3. These statistics are the Hodges-Lehmann estimator 
[Hodges and Lehmann (1963)], the Wilcoxon-signed rank statistic [Wilcoxon 
(1945)], the sample correlation integral [Grassberger and Procaccia (1983)] 
and an associated scale estimator proposed by Shamos (1976) and 
Bickel and Lehmann (1979). Section 5 develops the proofs of the results 
stated in Section 2. A supplemental article Levy-Leduc et al. (2010c) con- 
tains the proofs of some of the lemmas. It contains also Section 6 which 
concerns numerical experiments. 

2. Main results. We start by introducing the terms involved in the 
Hoeffding's decomposition [Hoeffding (1948)]. Recall the definition of U n (-) 
in (1) and let U(-) be defined as 

(3) U(r) = / h(x, y, r)ip(x)ip(y)dxdy , for all r in / , 

Jm 2 

where tp denotes the p.d.f of a standard Gaussian random variable and h is 
given by (2). For all x in R, and r in /, let us define 



The Hoeffding decomposition amounts to expressing, for all r in /, the dif- 
ference 




(5) 



U n {r) - U(r) 



1 



£ [h{X t ,X J: r)-U{r)\ 



n(n — 1) 



as 



(6) 



U n (r)-U(r) = W n (r) + R n (r) , 
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where 

n 

(7) W n (r) = -Y J {hi(X l ,r)-U(r)} , 

i=i 

and 

(8) 

Rn(r)= . 1 it V {M^^.^-^i^rj-Zn^.^ + ^r)} . 
n n — 1 

We now define the Hermite rank of the class of functions {h(-, •, r) — U(r),r 6 
/} which plays a crucial role in understanding the asymptotic behavior of 
the [/-process U n (-). We shall expand the function (x,y) t- > h(x,y,r) in a 
Hermite polynomials basis of L 2 (M 2 ), that is, the L 2 space on M? equipped 
with product standard Gaussian measures. We use Hermite polynomials 
with leading coefficients equal to one which are: Hq(x) = 1, H\(x) = x, 
H 2 (x) = x 2 - 1, H 3 (x) = x 3 - 3x, . . . .We get 

(9) h(x,y,r) = ^rr-H p (x)H q (y) , in L 2 (M 2 ) , 
where 

(10) a m (r) = E[h(X, Y, r)H p (X)H q (Y)} , 

and where (X, Y) is a standard Gaussian vector that is X and Y are inde- 
pendent standard Gaussian random variables. Thus, 



(11) E[h\X,Yr)]= 



p\q\ 

p,q>0 

Note that ao,o( r ) is equal to U(r) for all r, where U(r) is defined in (3). 
The Hermite rank of h(-, -,r) is the smallest positive integer m(r) such that 
there exist p and q satisfying p + q = m{r) and a Pt q(r) ^ 0. Thus, (9) can 
be rewritten as 

(12) h(x, y, r) - U(r) = £ ^^(^(y) , in L% (R 2 ) . 

p,q>0 

p+q>m(r) 

The Hermite rank m of the class of functions {h(-, ■, r) — U(r) , r E J} is the 
smallest index m = p + q > 1 such that a P) g(r) ^ for at least one r in /, 
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that is, m = inf rg / m(r). By integrating with respect to y in (9), we obtain 
the expansion in Hermite polynomials of hi as a function of x: 

(13) h 1 (x,r)-U(r) = ^ i ^^-H p (x), mL*(R), 

p>i P' 

where L£(R) denotes the L? space on R equipped with the standard Gaus- 
sian measure. Let r(r) be the smallest integer greater than or equal to 1 such 
that a Ti o(r) 7^ 0, that is, the Hermite rank of the function h\(-,r) — U(r). The 
Hermite rank of the class of functions {h\(-,r) — U(r) , r G 1} is the smallest 
index r > 1 such that a Tj o(r) ^ for at least one r. Since r(r) > m(r), for 
all r in /, one has 

(14) t >m . 

In the sequel, we shall assume that m is equal to 1 or 2. As shown in Section 
4, this covers most of the situations of practical interest. Theorem 1, given 
below, establishes the central-limit theorem for the [/-process {\/n(U n (r) — 
U(r)),r E /} when 

D > 1 /m and m = 2 . 

Theorem 1. Let I be a compact interval o/R. Suppose that the Hermite 
rank of the class of functions {h(-, -,r) — U(r) ,r G 1} as defined in (12) is 
m = 2 and that Assumption (Al) is satisfied with D > 1/2. Assume that h 
and h±, defined in (2) and (4), satisfy the three following conditions: 

(i) There exists a positive constant C such that for all s, t in I, u, v in 
R, 

(15) E[\h(X + u,Y + v,s)-h(X + u,Y + v,t)\]< C\t - s\ , 

where (X, Y) is a standard Gaussian vector, 
(ii) There exists a positive constant C such that for all k > 1, 

(16) E[\h(X 1 +u,X 1+k + v,t)-h(X 1 ,X 1+k ,t)\]< C(|u| + H), 



(17) E[\h(X 1 ,X 1+k ,8)-h(X 1 ,X 1+k ,t)\]<C\t-s 
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(Hi) There exists a positive constant C such that for all t, s in I, and x, u, 
v in M, 

(18) \hi(x + u,t)- hi(x + v,t)\ < C(\u\ + \v\) , 
and 

(19) \h 1 (x,s)-h 1 (x,t)\<C\t-s\. 
Then the U -process 

{y/n(U n (r) — U(r)),r G /} 

defined in (1) and (3) converges weakly in the space of cadlag functions T>(I) 
equipped with the topology of uniform convergence to the zero mean Gaussian 
process {W(r),r G /} with covariance structure given by 

(20) E[W{s)W{t)] = 4 Cav(h 1 (X 1 ,s),h 1 (Xi,t)) 

+ 4 ^{Cov(/i! {X x (X t+1 ,t)) + Cov(/ii (Xl ,t),/il (X, +1 , s))} . 
£>1 

Proof of Theorem 1. The proof of the theorem follows from the de- 
composition (6) and Lemmas 9 and 10, given in Section 5.1. Lemma 9 states 
that {y/nW n (r), r £ 1} converges weakly in the space of cadlag functions 
T>(I) equipped with the topology of uniform convergence. Lemma 10 states 
that sup rg / y/n\Rn(r)\ = op(l). Its proof uses Lemmas 11, 12 and 13. □ 

Remark 1. The examples of Section 4 satisfy the conditions (15) to 
(19), e.g. through the choice G(x,y) = (x + y)/2. More generally, suppose 
either: 

(i) G is linear. 

(ii) The function G can be written as G{x, y) = g(L(x, y)) where L(x, y) = 
ax + (3y is some linear function of (x, y), a and /3 in R are such that \a\ = \(3\ 
and g is an even function satisfying for some X g > 0: Vx, t < g{x) < s =^ 
Xgt < \x\ < X g s. 

(hi) G > and satisfies the triangle inequality: G{x + x', y + y') < G(x, y) + 
G(x' , y') and there exists some constant C such that for all (x, y), G(x, y) < 
C(\x\ + \y\). 

Then Condition (i) implies Conditions (15) to (19), Condition (ii) implies 
Conditions (15), (17) and (19) and Condition (iii) implies Conditions (16) 
and (18). The proofs are based on techniques similar to the verification of 
(27) in Section 4.1 where G(x,y) = (x + y)/2. 
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Remark 2. The set I in the previous theorem may be equal to [—00, +00] 
which involves the two-point compactification of the real line. Since [—00, +00] 
is compact, all functions in T>([— 00, +00]) are bounded. In fact, that space 
is isomorphic to T>[0, 1]. 

When D < 1/m, W n and R n are not the leading term and the remainder 
term, respectively. Note that, on one hand, for a fixed r, Corollary 2 of 
Dehling and Taqqu (1989) gives R n {r) = Op{n~ D L{n)) for any D in (0, 1). 
On the other hand, if D < 1/r, where r is defined in (14), Theorem 6 of 
Arcones (1994) implies that W n (r) = Op{rT rD l 2 L{n) T I 2 ) and if D is in 
(1/r, 1/m), W n (r) = Op(n" 1 / 2 ) by Theorem 4 of Arcones (1994). Thus, 
if for instance, r = m = 2, W n {r) and R n (r) may be of the same order 
Op(n~ D L(n)). Hence, to study the case D < 1/m, we shall introduce a 
different decomposition of U n {-) based on the expansion of h in the basis 
of Hermite polynomials given by (9). Thus, U n {r) defined in (1) can be 
rewritten as follows 



(21) 

where 

(22) 



n{n - l){U n {r) - U(r)} = W n (r) + R n {r) , 



Wjr 



E E 

p+q<m 



p\q\ 



-H p {Xi)H q {Xj) . 



Introduce also the Beta function 



(23) B(a, P) = r y a -\l + y^dy = , a > 0, (3 > . 

Jo T(a + /3) 

The limit processes which appear in the next theorem are the standard 
fractional Brownian motion (fBm) (Zi y r>(t))o<t<i and the Rosenblatt process 
(^2,£)(*))o<*<i- They are defined through multiple Wiener-Ito integrals and 
given by 



(24) Z 1 . D (t)= I \ I (u-x) 

JR UO 



-(D+l)/2 



du 



dB(x), < D < 1 , 



and 
(25) 

Z2,D(t) 



(u-x)- {D+1)/2 (u-y)^ D+1)/2 du 



dB(x)dB(y), < D < 1/2 , 
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where B is the standard Brownian motion, see Fox and Taqqu (1987). The 
symbol j' means that the domain of integration excludes the diagonal. Note 
that Zi^d and Z 2 ,d are dependent but uncorrelated. The following theorem 
treats the case D < l/m where m = 1 or 2. 

Theorem 2. Let I be a compact interval o/M. Suppose that Assumption 
(Al) holds with D < l/m, where m = 1 or 2 is the Hermite rank of the 
class of functions {h(-,-,r) — U(r) ,r 6 /} as defined in (12). Assume the 
following: 

(i) There exists a positive constant C such that, for all k > 1 and for all 
s, t in I, 

(26) E[\h(X 1 ,X 1+k ,s)-h(X 1 ,X 1+k ,t)\]<C\t-s\ . 

(ii) U is a Lipschitz function. 
(Hi) The function A defined, for all s in I, by 

(27) A(s) = K[h(X,Y,s)(\X\ + \XY\ + \X 2 - 1|)] , 

where X and Y are independent standard Gaussian random variables, 
is also a Lipschitz function. 

Then, 

( n mD/2 L ( n ym/2 _ [/( r )) ;r g /| 

converges weakly in the space of cadlag functions T>(L), equipped with the 
topology of uniform convergence, to 

{2 aifl (r)k(Dy 1 ' 2 Z hD {l)- rG/}, if m = 1 , 

and to 

{k(D)- 1 [a hl (r)Z 1:D (l) 2 +a 2fi (r)Z 2jD (l)} ;r€ /}, if m = 2 , 

where the fractional Brownian motion Zi t n( m ) and the Rosenblatt process 
^2,d(~) are defined in (24) and (25) respectively and where 

(28) fc(D) = B((l-I>)/2 > D) > 

where B is the Beta function defined in (23). 
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The proof of Theorem 2 is given in Section 5.3. 



Remark 3. The processes Z\^d and Z 2) d are self-similar with mean 0. 
They are, however, not normalized. One has 

nzx, D mi,D^)\ = nzi D (i)\\ {t 2 ^ + s ^ -\t- > , 
nz 2 , D (t)z 2 ,D(s)} = nzi D (i)}^ {t 2H2 + s 2H * -\t- S \ 2H *} , 

where H x = 1 - D/2 G (0, 1/2), H 2 = 1 - D G (0, 1/2) and 

(29) nzi D (i)] - 



(so) nzi D m 



(-D + l)(-D + 2) 

Ak(D) 2 
(-2D + l)(-2D + 2) ' 



with k{D) defined by (28). See Remark 5 below for justification. The non- 
Gaussian random variables Z\ D (X) and Z 2 ^(l) are dependent. Their joint 
cumulants are given in (89) in the supplemental article Levy-Leduc et al. 
(2010c). 

Remark 4. The results of Theorem 2 can be extended to the two- 
parameter process {U[ nt ](r) — U(r);r G 1, < t < 1}. One can show that 

f n mD/2 \ 
\U^{ U \nA(r)-U{r));reI^<t<l^ 

converges weakly in V(I x [0,1]), equipped with the topology of uniform 
convergence, to 

{2ai fi (r)k(D)~ 1 / 2 Z 1)D (t); r G /, < t < 1} , if m = 1 , 

and to 

{k(D)- 1 [a 1A (r)Z ltD (t) 2 + a 2fi (r)Z 2>D (t)] ; r G /, < t < 1} , if m = 2 . 
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3. Asymptotic behavior of empirical quantiles. We shall apply 
Theorems 1 and 2 in the preceding section to empirical quantiles. Recall 
that if V : I — > [0, 1] is a non-decreasing cadlag function, where I is an 
interval of M, then its generalized inverse V~ l is defined by V~ l {p) = inf{r € 
I, V(r) > p}. This applies to U n (r) and U(r) since these are non-decreasing 
functions of r. We derive in the following corollaries the asymptotic behavior 
of the empirical quantile U~ l {-) using Theorems 1 and 2. 

Corollary 3. Let p be a fixed real number in (0, 1). Assume that the 
conditions of Theorem 1 are satisfied. Suppose also that there exists some r 
in I such that U{r) = p, that U is differ entiable at r and that U'(r) is non 
null. Then, as n tends to infinity, 

Mu-\p) - U-\p)) A -W{U-\p))/U'{U-\p)) , 

where W is a Gaussian process having a covariance structure given by (20). 

Proof of Corollary 3. By Lemma 21.3 in van der Vaart (1998), the 
functional T:7h> V~ 1 (p) is Hadamard differentiable at V tangentially to 
the set of functions h inX>([0, 1]) with derivative T{,(h) = -hiV^ip^/V'iV' 1 
Applying the functional Delta method (Theorem 20.8 in van der Vaart (1998)) 
thus yields 

V^{U-\p) - U-\p)) = T{j{Vn~(U n - U)} + o P (l) 

(U n -U){U-\p)) 

= w(u-\p)) + ° p(1) - 

The corollary then follows from Theorem 1. □ 

Corollary 4. Let p be a fixed real number in (0, 1). Assume that the 
conditions of Theorem 2 are satisfied. Suppose also that there exists some r 
in I such that U(r) = p, that U is differentiable at r and that U'(r) is non 
null. Then, as n tends to infinity, 

„mD/2 

(u-\ P )-u-\p)) 



L(n) m / 2 
converges in distribution to 
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-k(D)- 1 {a 1 , 1 ([/- 1 (p))Z 1 , D (l) 2 + a 2fi (U- 1 (p))Z 2 , D (l)} /U'(U-\p)) , if m = 2 , 



where Z\^{-) and ^2,_d( - ) are defined in (24) and (25) respectively, k(D) in 
(28) and a p q (-) is defined in (10). 

The proof of Corollary 4 is based on similar arguments as the proof of 
Corollary 3 and is thus omitted. 

4. Applications. We shall use the results established in Sections 2 
and 3 to study the asymptotic properties of several estimators based on 
[/-processes in the long-range dependence setting. 

4.1. Hodges-Lehmann estimator. Consider the problem of estimating the 
location parameter of a long-range dependent Gaussian process. Assume 
that (Yi)i>\ satisfy Yi = 6 + Xi where (Xi)i>i satisfy Assumption (Al). 
To estimate the location parameter 9, Hodges and Lehmann (1963) suggest 
using the median of the average of all pairs of observations. The statistic 
they propose is 



Define the [/-process U n (r), r E R by (1), where G(x,y) = (x + y)/2. The 
Hodges-Lehmann estimator may be then expressed as 



using x<p(x) = —ip(x), where <p denotes the first derivative of (p. The quanti- 
ties in (31) are different from for all r in M since (p is the p.d.f of a standard 
Gaussian random variable. Thus, the Hermite rank m of the class of func- 
tions {iQf .\< T — aofi(r);r E R} is equal to 1. In order to derive the asymp- 
totic properties of 9hl, we now check the conditions of Theorem 2. Let us 



and to 




e HL = e + u- 1 (i/2). 



If A and B are independent standard Gaussian variables, 
(31) 

ai,o(r) = «o,i( r ) = H A1 {A+B<2r}] = ~ v(2r-y)ip(y)dy = -<p( 




2 ■ 
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check Condition (26). Note that for all k > 1, X x +X l+k ~ 7V(0, 2(l+p(fc))), 
thus if t < s, 

E[h(X 1 ,X 1+k ,s) - h(X u X 1+k ,t)] = $ (-^k=) -$ ' 



1 It -s| 



^/l + ' 

where $ is the c.d.f of a standard Gaussian random variable and = 
inf fc p(fc) > -1. Hence (26) holds. Similarly, \U(s) - U(t)\ < \$(y/2s) - 
<I>(\/2£)| < vr" 1 / 2 ^ — s| and hence J7 is a Lipschitz function. Let us now 
check Condition (27). Note that, if s < t 

^{s<x+y<t}(\ x \ + \ x v\ + \ x2 ~ M) l f(x)^(y)dxdy = 

ip(y)dy) \x\<p(x)dx + / ( / \y\<p(y)dy) \x\<p(x)dx 



+ J (J <^{y)dy S j \x 2 - l\(p(x)dx . 

Using that <p{-) and |.|<p(-) are bounded and that the moments of Gaussian 
random variables are all finite, we get (27). The assumptions of Theorem 2 
are thus satisfied with m = 1 and hence we get that 

\n D/2 L(n)- 1/2 (U n (r) - U{r)) ; -oo < r < +00} 

converges weakly in D([— 00, +00]), equipped with the sup-norm, to 

{-y/2k(D)- 1/2 ip(rV2)Z ltD (l); -00 < r < +00} . 

Here, U(r) = J $(2r - x)ip(x)dx, U'(r) = 2/ <p(2r - x)<p(x)dx, 17(0) = 
1/2 /($(a?) + $(-x))ip(x)dx = 1/2, U~ l {l/2) = and U'{U~ l {l/2)) = 
U'(0) = 1/0F- Since, by (31), a^U' 1 (1/2)) = a 1)0 (0) = -(20F)" 1 , 
Corollary 4 implies that 

(32) n D l 2 L(n)- l / 2 (e HL - 0) A k(D)' 1 / 2 Z hD (l) , 

where using (29), k(D)~ 1 / 2 Zi^£>(l) is a zero-mean Gaussian random variable 
with variance 2(—D + 1) _1 (— D + 2) _1 . Let's now compare the asymptotic 
behavior of the Hodges-Lehmann estimator with that of the sample mean. 
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Lemma 5.1 in Taqqu (1975) shows that the sample mean Y n = n 1 Y^7=i ^ 
satisfies the following central limit theorem 

n D / 2 L(n)~ 1/2 (Y n - 9) A k(D)-^ 2 Zi, D (l) . 

We have thus proved 

Proposition 5. In the long-memory framework with < D < \, the 
asymptotic behavior of the Hodges-Lehmann estimator is Gaussian and given 
by (32). It converges to 9 at the same rate as the sample mean with the same 
limiting distribution. There is no loss of efficiency. 

A similar result was proved in Beran (1991) for location M-estimators. 

4.2. Wilcoxon- signed rank statistic. Assume that (l^)j>i satisfy Yi = 
9 + Xi where pQ)j>i satisfy Assumption (Al). The Wilcoxon-signed rank 
statistic first proposed by Wilcoxon (1945) can be used to test the null 
hypothesis (Ho): "0 = 0" against the one-sided alternative (Hi): " 9 > 
", based on the observations Y"i, . . . , Y n . It is defined as 

n 

Tn = ^#il{X;>0} ) 
3=1 

where the Rj's are the ranks of Xi, . . . ,X n . Thus T n is the sum of the 
ranks of the positive observations. Let us study this statistic under the null 
hypothesis. One will reject the null hypothesis if the value of T n is large. 
Following Dewan and Prakasa Rao (2005), T n can be written as 

n i i \ 

(33) T n = 2^ l{x 4 >o} + Yl 1 {x 1 +x ] >o} =■ nU n>1 + U n>2 ■ 

i=l 1<*<3<^ 

The Hermite rank of 1{.>o} — > 0) equals 1, because E[Ai(l{Xi>o} — 
¥(X\ > 0))] > 0. We then deduce from Theorem 6 of Arcones (1994) that 

(34) n D l 2 L(n)- l l 2 (U n ,i - F(Xi > 0)) = O p (l) . 

The asymptotic properties of U n ^2 can be derived from those of U n (0) where 
U n (-) is the L^-process defined in (1) with G(x, y) = x + y. Using the results 
obtained in the study of the Hodges-Lehmann estimator, we obtain that 
«i,o( r ) = «0,i( r ) = —tp^/Vty/V^, which is different from for all r in 
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R since ip is the p.d.f of a standard Gaussian random variable. Thus, the 
Hermite rank of the class of functions {l(3( v )<r — a o,o( r )i r £ R} is equal 
to 1. Using the same arguments as those used in the previous example, 
the assumptions of Theorem 2 are fulfilled with m = 1. Since 20x^(0) = 
-2<^(0)/V2 = -1/0F, we get 

(35) n°/ 2 L(n)- 1 /2 (C/ _ £/ 2 ( )) A , 



7T 



where [/2(C)) = f f l{ x+y> o}(p(x)<p(y)dxdy = 1/2 and fc(-D) is the constant 
given in (28). From (33), (34) and (35), we get 

2n D / 2 

:{T n - n¥(X 1 > 0) - n{n - l)C/ 2 (0)/2) 



n[n- l)L(n) 1 / 2 



n D / 2 a k(n)- 1 / 2 

(U n ,2 - tMO)) + op(l) A 1 ' Z ljD (l) , 



L(n)V2 

which can be rewritten as follows 

(36) n D l*L{n)-W {-r^T n L- - 1/2) A ^^^(1) . 

\n{n — 1) n — 1 / t/7t 

We have thus proved 

Proposition 6. In the long-memory case with < D < 1, the asymp- 
totic behavior of the Wilcoxon- signed rank statistic T n is Gaussian and given 
by (36). 

The Wilcoxon one-sample statistic U n ,2 was also studied by Hsing and Wu 
(2004) pp. 1617-1618 by using a different approach. We obtain the additional 
constant k{D)~ 1 / 2 in the limiting distribution compared to their result. 

4.3. Sample correlation integral. In the past few years, a lot of atten- 
tion has been paid to the estimation of the correlation dimension of a 
strange attractor. In many examples, the correlation dimension a of an 
invariant probability measure \x can be expressed through the correlation 
integral C M (r) = (// X fj,){(x,y) : \x — y\ < r} through C M (r) « Cr a , as 
r tends to 0, where C is a constant. For further details on the correlation 
dimension and its applications, see Borovkova, Burton and Dehling (2001). 
Grassberger and Procaccia (1983) proposed an estimator of the correlation 
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dimension based on the sample correlation integral U n (r), denned in (1) with 
G(x,y) = \x-y\. In this case, ai )0 (r) = a ,i(r) = J R xl{\ x _ y \< r }tp{x)ip(y)&xdy 
= f^x[&(x + r) — &(x — r)]ip(x)dx, where, as before, (p and $ are the 
p.d.f. and the c.d.f. of a standard Gaussian random variable, respectively. 
Using the symmetry of a standard Gaussian random variable, one gets 
a i,o( r ) = ao.i( r ) = 0. Lengthy but straightforward computations lead to 

(37) a 2 ,o(r) = a 0:2 (r) = -«i,i(r) = Lp(r/y/2) , 

where <p denotes the first derivative of ip. It is non-null if r ^ 0. Thus, for 
any compact interval / which does not contain 0, the Hermite rank of the 
class of functions {^G(-,-)<r ~ a o,o( r )i r £ -0 is equal to 2. Let us assume 
that (Xi)i>i satisfy Assumption (Al). In the case where D > 1/2, let us 
check the assumptions of Theorem 1. Conditions (15) and (16) can be easily 
checked and Condition (17) is fulfilled by using similar arguments as those 
used in the example of the Hodges-Lehmann estimator. Conditions (18) and 
(19) are satisfied since 

(38) hi(x,r)= / t {lx _ yl < r} ip(y)dy = $(x + r) - ${x - r) , 

where $ is the c.d.f of a standard Gaussian random variable. Thus, in the 
case where D > 1/2, 

{y/E(U n (r)-U(r)),reI} 

converges weakly in T>(I), equipped with the topology of uniform conver- 
gence, to the zero mean Gaussian process {W(r),r £ /} with covariance 
structure given by 

(39) E[W{s)W{t)] = 4 Cov(/ii(Ai,s),/ii(Ai,t)) 

+ 4 £{Cov(fci (Ai , s), h x (A m , t)) + Cov(/ Jl (Ax ,t),h 1 (A m , *))} , 

where h\ is given in (38). If D < 1/2, with similar arguments as those 
used in the example on the Hodges-Lehmann estimator, the assumptions of 
Theorem 2 are satisfied with m = 2 and we get using (37), that 

{k{D)n D L{n)~ l {U n (r) - U{r));r G /} 
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converges weakly in T>(I), equipped with the topology of uniform conver- 
gence, to 



where I is any compact set of M which does not contain 0. Thus 

Proposition 7. In the long-memory case with 1/2 < D < 1, the asymp- 
totic behavior of the sample correlation integral U n (r) , r £ I is Gaussian with 
covariance (39). If < D < 1/2 and if I is a compact set in R which does 
not contain 0, then the limit is non-Gaussian and given in (40). 

4.4. Shamos scale estimator. Assume that (li)j>i satisfy Yi = aXi where 
(Xj)j>i satisfy Assumption (Al). The results of the previous subsection can 
be used to derive the properties of the estimator of the scale a proposed 
by Shamos (1976) and Bickel and Lehmann (1979). From Yi,...,Y n , it is 
defined by 

&bl = c median{|li— Yj\; l<i<j<n} = ca median{| X{— Xj\; 1 < i < j < 

where c = l/(\/2<]? _1 (l/4) « 1.0483 and $ is the c.d.f of a standard Gaus- 
sian random variable to achieve consistency for a in the case of Gaussian 
distribution, obl involves the median of the distance between observations. 
As is the case for the standard deviation, if the Yj's are transformed into 
aYi + b, then &bl is multiplied by \a\. Here G(x,y) = \x — y\, U(r) = 



f[$(x + r) - $(a - r)]<p(x)dx, U'(r) = 2j<p(x + r)(p(x)dx, U~ l {l/2) = 1/c 
and U'{U- l {\/2)) = U'(l/c) = >/2p(l/(c>/2)). By Corollary 3, we obtain 
that for D > 1/2, 



where W is a Gaussian process having the covariance structure (20) with h\ 
given in (38). Consider now the case D < 1/2. By (37), a 2 ,o(U~ 1 (l/2)) = 
— ai i i(f/~ 1 (l/2)) = —0:1,1(1/0) = <p(l/(cy/2)). Hence, we deduce from Corol- 
lary 4 that, if D < 1/2, 



(42) k(D)n D L(n)-\a BL - a) A gfi (^i,d(1) 2 - 2^(1)) 



(40) 



{<p(r/y/2)(Z2, D (l) ~ Zi, D {lf)\r 



el}. 



(41) 




-(Z 2 , (l)-Z liD (l) 2 ). 



a 
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Let us now compare the asymptotic behavior of the Shamos scale estima- 
tor with that of the square root of the sample variance estimator, <7 n ,y = 
(EILlPi " Y) 2 /(n - I)) 1 ' 2 . We have 

n 

n{n-l){al Y -a 2 ) = a 2 [nY J {X 2 -l)- £ Xtfj + n], 

i=l l<i,j<n 

so that by Lemma 15, 

k(D)n D L(ny\al Y - a 2 ) A a 2 (Z 2 , D (l) - Z^ D (lf) . 

We apply the Delta method to go from a 1 to a, setting f(x) = y/x, so that 
f'(a 2 ) = 1/(2^) = I /(2a). We obtain 

(43) k(D)n D L(n)~ l (a n ^Y - a) A |(Z 2j23 (l) - Z ljD (l) 2 ) . 

Thus, 

Proposition 8. In the long-memory case with 1/2 < D < 1, the asymp- 
totic behavior of the Shamos scale estimator &bl is Gaussian and given in 
(41)- If® < D < 1/2, it is non-Gaussian and given by (42); in this case, &bl 
converges to a at the same rate as the square root of the sample variance 
estimator with no loss of efficiency. 

5. Proofs of Theorems 1 and 2. 

5.1. Lemmas used in the proof of Theorem 1. 

Lemma 9. Under the assumptions of Theorem 1, the process {y/nW n (r),r E 
/}, where W n (-) is defined in (7), converges weakly in the space of cadlag 
functions equipped with the topology of uniform convergence to the zero mean 
Gaussian process {W(r),r G 1} with covariance structure given by 

E[W(s)W(t)] = 4 Cov(/ii (X x ,s),hi (Xi , t)) 

+ 4 £{Cov(fci [X x t8 )M (X i+1 ,t)) + Cov(/ii (Xx ,t)M (X l+l ,s))}. 

£>1 

The proof of Lemma 9 is in the supplemental article Levy-Leduc et al. 
(2010c). 
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Lemma 10. Under the assumptions of Theorem 1, 

sup y/n\R n (r)\ = o P (l) , 

where R n is defined in (8). 

The proof of Lemma 10 is in the supplemental article Levy-Leduc et al. 
(2010c). 

Lemma 11. Under the assumptions of Theorem 1, there exist positive 
constants C and a such that, for large enough n, 

(44) n{R n (t) - Rn(s)} 2 } < C , for all8,t€l, 

where R n is defined in (8). 

The proof of Lemma 11 can be found at the end of this subsection and is 
based on the following lemmas 12, 13 and 14. 

Lemma 12. Let f : M? — > R be a bounded function such that its deriva- 
tive (9 6 / /dx s dy 3 exists. Let (X,Y) be a standard Gaussian random vector. 
Assume that E {d i+j f{X,Y)/dx i dy j ) 2 < oo, for all 1 < i,j < 3, then 
the Hermite coefficients of f defined by c p>q (f) := K[f(X,Y)H p (X)H q (Y)] 
satisfy, for p,q>3 

(45) \c P M)\ < n(d 6 f(X,Y)/dx 3 dy 3 ) 2 ] 1 / 2 ^{p~^y. V '(^y. . 

The proof of Lemma 12 is in the supplemental article Levy-Leduc et al. 
(2010c). The following lemma is an extension of Corollary 2.1 in Soulier 
(2001) and is proved in the supplemental article Levy-Leduc et al. (2010c). 



Lemma 13. Let f\ and fa be two functions defined on W 11 ana 
respectively. Let T be the covariance matrix of the mean-zero Gaussian vector 
Y = (Y\,Y2) where Y\ andY2 are inM ai andW 12 , respectively. Assume that 
there exists a block diagonal matrix Tq of size (01 + 02) x (01 + 02) built from 
r with diagonal blocks Lo,i and Tq^ of size a\ x a\ and a<i x 02, respectively, 

. 1/2 1/2 

such that r* := ||L Q (Tq — L)L H2 < (1/3 — e), for some positive e. 
In the previous inequality \\B\\2 denotes the spectral radius of the symmetric 
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matrix B. If at least one function fi has an Hermite rank larger than t, then 
there exists a positive constant C{a\,a 2 ,e) such that 



\E\f 1 (Y 1 )f 2 {Y 2 )]\ < C'(ai,a 2 , e )||/i|| 2) r , 1 ||/2||2,ro, a (r*) [(T+1)/2] , 



(46) 

where \\fi\\lj . = (2vr)^/ 2 |r ,r 1/2 exp(-* T r o >/2)dx, i = 1,2 

and [x] denotes the integer part ofx. 

We shall use the following notation: for a Gaussian vector (Xi, X 2 , X 3 , X 4 ) 
with covariance matrix T and for any real- valued function of this vector, the 
expected value E[f(X h X 2 , X 3 ,X 4 )} will be denoted by E r [f(X 1 ,X 2 , X 3 ,X 4 )}. 

Lemma 14. Let (Xi, X 2 , X 3 , X 4 ) be a Gaussian vector with mean and 
covariance matrix 
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and let J a and Jb be functions from R 2 to R such that Er[J a (Xi, X 2 ) 2 ] < 00 
and Er[Jb(X 3 , X4) 2 ] < 00. Then, there is a Gaussian vector (X\, X 2 , X 3 , X 4 ) 
with mean and covariance matrix 
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Withp x? , = p 13 , PU = (PU-P13P34)/V 1 ~ P3A> P23 = (P23~ P12P13) I V 'l ~ Pl2> 

P24 + P12P13P34 — P12P1A — P23P3A 



P24 



V 1 ~ pfW 1 ~ Pl2 

If \Pij\ < P* for all i,j, then pij < p* for all i,j, where 

4p* 



(47) 



P 



1 - (p*)* ■ 

There are, moreover, functions J a and Jb such that 



E r [J a (X 1 ,X 2 )J b (X 3 ,X 4 )]=E r [J a (X 1 ,X 2 )J b (X 3 ,X 4 )} , 
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E r [J a (X 1 ,X 2 )]=E P [J a (X 1 ,X 2 )} , E r [J b (X 3 ,X i )]=E f ,[J b (X 3 ,X i )} , 
and 

E T [J a (X u X 2 ) 2 ]=E f [J a (X l ,X 2 ) 2 ) , E r [J b (X 3 ,X 4 ) 2 } = E r [J b (X 3 ,X 4 ) 2 } . 

If J a and J b are bounded, then J a and J b are bounded as well. 

The proof of Lemma 14 is in the supplemental article Levy-Leduc et al. 
(2010c). 

PROOF of Lemma 11. Note that R n {t) — R n (s) can be written as 

(48) R n (t)-R n (s) = 1 Yl J ( X i> X i), 
where 

(49) J(x,y) = J s ,t{x,y) = {h(x,y,t) - h(x,y,s)} - {hi(x,t) - /ii(x,s)} 

-{/ ll ( y ,t)-^ 1 (y, S )} + {[/(t)-C7( S )}. 

In the sequel, we shall drop for convenience the subscripts s and t. In view 
of the definition of h, hi and U in (2), (4) and (3) respectively, one has 

(50) ||J||oo<4, 

that is, J is bounded. Then, by Conditions (17) and (19), for any Gaussian 
vector (Xi, Xj, Xk, Xi), one has 

(51) E[\J(X h Xj)J{X k ,X t )\] <C-E.[\J{Xi,Xj)\] <C\t-s\ , 

for some positive constant C which may change from line to line. By the 
degeneracy of Hoeffding projections, expanding J into the basis of Hermite 
polynomials leads to: 

(52) J(x,y) = H P (x)H q (y) , with c , p = c p , = , Vp > , 

p,q>o P ' q ' 

where 

(53) c p , q (s, t) = E[J(X, Y)H p (X)H q (Y)} , 
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X and Y being independent standard Gaussian random variables. Therefore, 
using (51), 

(54) |cp, ? | < E[J(X, y) 2 ] 1/2 (p! ql) 1/2 < C(p! g!) 1/2 |t - s\ 1/2 . 

Remark that the sum in (52) is over p and q such that p + q > m, since the 
Hermite rank of J is greater than or equal to the Hermite rank of h. Using 
(48), we obtain that 
(55) 

E[{R n (t) -R n (s)} 2 } < 1 nJ(X h ,X i2 )J(X i3 ,X i4 )] . 

1 < l l 7^ l 2 — n 
l<J37^i4<n 

We shall consider 3 cases depending on the cardinality of the set {ii,i2,hiU}- 

1) We first address the case where i\ = 13 and 12 = i&- Using (51), we get 

nH ^_ 1)2 E nj(x n ,x l2 f]<^\t- s \, 

which is consistent with (44). 

2) Let us now consider the case where the cardinality of the set 12, 13, 14} 
is 3 and suppose without loss of generality that i± =13. Suppose also that 
p defined in Assumption (Al) has the following property: there exists some 
positive p* such that 

(56) \p(k)\ < p* < 1/13, for all k > 1 . 

If we apply the same arguments as in the previous case, we get a rate of order 
1/n instead of the desired rate l/n 1+a . To obtain the latter rate, we propose 
to approximate J by a smooth function J e using a convolution approach. 
More precisely, we define, for all x, y in R, 



(57) J £ (x,y) = J J(x — ez,y — Ez')(p(z)ip(z')dzdz' . 
Thus, 

(58) E[J(X ll ,X l2 )J(X ll ,X i4 )}=K[J £ (X il ,X l2 )J(X il ,X u )} 

+ E[(J-J e )(X il ,X ia )J(X il ,X i4 )] 
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Applying Lemma 12 to / = J £ and noting that, by Condition (15), ||<9 6 J e /cte 3 ch/ 3 
Ce~ 6 \t — s\ 1 / 2 , for some positive constant C, we obtain 

E[J £ (X n ,X i2 )J(X n ,X i4 )] 

< E [^T \3h^\ H {X , ll )J(X ll ,X i4 )H g (X l2 )\] 
p% Plql 

< Ce~ Q \t - s\ 1 ' 2 Y, W) -1 VG^W V^F3)! 

Pi9>3 

where c Pi9 ( J e ) is the (p, g)th Hermite coefficient of J e . We shall apply Lemma 
13 with Yi = (X tl ,X H ), Y 2 = X i2 , oi = 2, a 2 = 1, r ,i = Id, r , 2 = 1, 
fi = H p J and f 2 = H q . Observe that Id — V is a 3 x 3 matrix with p entries 
and hence 1 1 Id — r 1 1 2 < (01 + a2)||Id — r||oo = 3 1 1 Id — r||oo = 3/9*, where ||^4||oo 
is defined for a matrix A = (ciij)ij by ||^4||oo = maxjj \a^j\. Hence, by (56), 
the condition on r* of Lemma 13 is satisfied. Since J is bounded and f 2 is 
of Hermite rank larger than 2, Lemma 13 with [(2 + l)/2] = 1 implies that 
there exists a positive constant C such that 
(59) 

lElHpiXiJJiX^XiJHqiXiJ}] < Cy/^\(\p{U-h)Mp(i2-h)Mp(h-ix)d ■ 

Hence, 
(60) 

E^X^X^JpQ^XiJ] < Ce-^t-sl 1 / 2 (|p(i 4 -*2)|+|p(t2-ii)|+|p(»4-ti)|) • 

Since, for example, Ei<i 2 ^i 4 <n \p(u-i2)\ < nY,\k\<n l/°( fc )l> and since tnere 
exist positive constants C and 5 such that \p(k)\ < C(l A |/c|~ £)+<5 ), for all 
k > 1, we obtain that ^i<j 2 =^j 4 < n |p(»4 - »2)| < Cn 2 - D+<5 . Hence, 

(61) n2(re _ 1)2 £ E[^(^ i; ^ 2 )^n,^ 4 )]< J +D J . 

1 <i 4 7^2 < n 

I<ii7^i4<n 



We now focus on the last term in (58). By the Cauchy-Schwarz inequality 
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and (51), 

( 62 ) 2? 1 U 2 E E[(J- J £ )(X n ,X i2 )J(X n ,X J4 )] 
n z in — IV 



n 2 (n — iy 

l<ii7^24<n 



It -si 1 / 



^J^V^l) E E[(J-J £ ) 2 (X n ,X i2 )]V2 



Using (57), the Jensen's inequality and (50), 
(63) E[(J-J £ ) 2 (X H ,X i2 )] 

[J(x,y) - J(x - ez,y - ez')]<p(z)(p(z')dzdz' \ f iui2 (x,y)dxdy 



</ W [^(^2/) - J(x-ez,y-ez')] 2 f il>i2 (x,y)dxdy \ ip(z)y(z!)dzdz' 
Jm. 2 Ui 2 J 

< C / {/ l^(^,y)-«/(^-^,y-e2; / )|/i 1 ,i2( 2; ) y)dxdy}v9(z)^(2;')dzdz / , 
Jr 2 jr 2 

where is the p.d.f of (Xj 15 Xi 2 ). By (49), Conditions (16) and (18), 

(64) / {/ \J(x,y) - J(x - ez,y - £z')\f iu i 2 (x,y)dxdy}ip(z)ip(z')dzdz' 
Jr 2 Jr 2 

< Ce [ (\z\ + \z'\)ip(z)ip(z')dzdz' < Ce . 



Using (62), (63) and (64), we get 
(65) 

— — E n{J-Je){x^x l2 )j{x n ,x ti )]<c — 1 !— . 

n A [n — \y z — ' n 

l<ii^«4<n 

Note that (61) involves the factor e~ 6 and (65) involves the factor e 1 / 2 . By 
choosing e = e n = n~ u with < v < (D — 5)/6 in (61) and (65), we obtain 
a result consistent with (44). 

If Condition (56) is not satisfied then let r be such that p{k) < p* < 1/13, 
for all k > r. In the case where, for instance, \i% — ij| < r then, using that 
J is bounded, Conditions (17) and (19), we get that 

Y E[J(X il ,X i2 )J(X il) X i4 )] < C- H 



n^(n — 1)^ z — ' ra z 

l<Zl^24<n,|22— H|<T 
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instead of (61), but the result is still consistent with (44). The same result 
holds when \i± — i±\ < r or \i% — %2\ < r. Note also that the remaining sum 
over the indices such that \i\ — %2\ > r, \ii — h\ > t and \i2 — h\ > t can be 
addressed in the same way as when Condition (56) is satisfied. 

3) Now, we assume that the cardinal number of the set {i\ , 12 , ^3 , 14} equals 
4 and that Condition (56) holds. By Lemma 14, 

(66) E[J(Xi 1 ,Xi 2 )J(Xi 3 ,Xi 4 )] =Ef[Ji l! i 2 (Xi 1 ,Xi 2 )Ji 3t i 4 (Xi 3 ,Xi 4 )] . 

Here (X^ , Xi 2 , Xi 3 , X^ 4 ) is a Gaussian vector with mean and covariance 
matrix T defined in Lemma 14 where pij = p{i — j), J a = Jb = J, J a = 
Ji lt i 2 and Jfe = Ji 3) i 4 - Since the covariance of (X^jX^) and (Xi 3 ,Xi 4 ) 
is the identity matrix, we can expand Jj li j 2 (Xj 1 ,Xj 2 ) and Ji 3 ^ 4 (Xi 3 , Xi 4 ). 
Jij ) i 2 (Xi x , Xi 2 ) is the limit in L 2 , as K — > 00, of 



K 



£1,12 



(67) j£ i (Xj 15 X i2 ) — V] , Pl f 2 , H Pl (X h )H P2 (X i2 ) , 

with a similar expansion for J^ i4 (Xi 3 , Xi 4 ). Therefore, 
(68) 

Jim Ep[Jj 1) i 2 (Xi 1 ,Xi 2 )Ji 3j i 4 (Xi3,Xi 4 )-J i j Xjj)^ j (Xi 3 ,X i4 )] = . 
Thus it is enough to majorize 
(69) E r [ J5 2 (X n , X,, ) j£ i4 ( , X iA )] 

< E E |Er[^^i)^(^)g w (x 3 )g w (x 4 )]| 

By Lemma 3.2 in Taqqu (1977), E r [^ pl (X il )if P2 (X i2 )^ P3 (A > i3 )F P4 (X i4 )] is 
zero if pi + • • • + P4 is odd. Otherwise it is bounded by a constant times 
a sum of products of (pi + • • • + Pa)/2 covariances. These will be denoted 
pi j = E(XiXj) and are given in Lemma 14. Since p(k) < p* < 1/13, we 
have that p itj < p* < 1/3, where p* = 4p*/(l - (p*) 2 ) by (47). Bounding, 
in each product of covariances, all the covariances but two, by p* < 1/3, we 
get that 

E r [F Pl (X n ) J ff P2 (A > i2 ) J ff P 3(A > i3 )F P4 (A > i4 )] is bounded by 
(70) 

C (3p*) Pl +P2 T" +PA - 2 A(h, i 2 ,i 3 ,u)\E[H pi (X)H P2 {X)H P3 (X)H P4 (X)] | , 
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where, since p ilji2 = p isM = 0, 



(71) A{i\, 12-, is, 24) — Pi\,izPi%,i\ ~l~ Pi2,izPil,ii Pii, izPiz,iz Pii,i&Pi2,i& 

Pii,izPi\M Pi2,hPi2,U i 

and where X is a standard Gaussian random variable. Note also that the 
hypercontractivity Lemma 3.1 in Taqqu (1977) yields 

P1+P2+P3+P4 



(72) \E[H P1 (X)H P2 (X)H P3 (X)H P4 (X)}\ < 3 a y/ Pl \ p 2 \ p 3 \ Pi \ 

Thus (69) is bounded by 

ca ( £ J^^^A ( y: -^m*^- 1 

\l< P ^<K J \l< P ^<K 

By the Cauchy-Schwarz inequality, the first term in brackets is bounded by 

1/2 / ^ 1/2 

(JlM \2 ' 



< 73 > E » E 

\1<P1,P2<-K Fi '' P -" / \l<Pl,P2</< 

<Ei [J il>i2 (X h ,X i2 ) 2 ] 1/2 [^(Sp*)*- 1 

\p>i 

where I is the identity matrix and similarly for the second term. Since p* < 
1/3, it follows from Lemma 14 that (69) is bounded by 

(74) CA Ei [Ji^X^X^)] 1 ' 2 ^ [j i3ti4 (X i3 ,X u )] 1/2 

= CAE ril [j(X h ,X i2 ) 2 } 1/2 Er 22 [J(X ia , X l4 ) 2 ] 1/2 < CA\t - s\ , 

where we used (51) and the fact that J is bounded. Thus, in wiew of (66), 
(68), (69) and (74), we have 

( 75 ) 2 ( 1 u2 E nj(x h ,x i2 )j(x i3 ,x u )} 

n z [n — l) z ^— ' 

1 >*2 >*3 '*4 — 71 



|{u,22,*3,m}|=4 



- C ^2l 7\2 E ^(*l>*2,i3,*4) 

n z (n — 1 r ' 

1^*1)*2>*3>M — n 
|{il,*2,t3jU}|=4 



U-PROCESSES UNDER LONG-RANGE DEPENDENCE 



27 



We need to evaluate that sum. Recall that A = A(i%, ii-, 13, k) is defined in 
(71) with the pij defined in Lemma 14. We shall treat one summand in A 
(the others are treated in the same way). We have 

Ph,nPi2,i4 < Cp(h - h)[p{h - k) + p{k - k) + p{h ~ k) + Pih - k)] ■ 

Using that there exist positive constants C (changing from line to line) and 
e such that \p(k)\ < C(l A \k\- D+£ ), for all k > 1, we get 



\k\<n 

< Cn A - 2D+2£ , 



1<»1,»2i*3i»4<*» l<ii^l2<n |fc|<n 

|{il,i 2 ,i3,i4}|=4 



and 



|{il,i2,«3,*4}|=4 

71 ?>2 — 1 Tl Tl 

= »E(E^^)+ E p(n-, 2 )) 2 <Cn^,r 2 ^<Cn 4 - 2 ^. 

12=1 il=l «i=l2 + l «2 = 1 

Therefore, Relation (75) is bounded by Cn~ 2D+2e \t — s\, which is a result 
consistent with (44) with 2e = D - 1/2 > 0. 

If Condition (56) is not satisfied, then let r be such that 

(76) sup p(k) < i-(l- sup . 

In the case where, for instance, \i± — < r and \%2 — k\ < r, there is no 
need to use (76) because we get using (51), 

(77) n 2(n - l)^ E E[J(X tl ,^ 2 )J(X, 3 ,X i4 )] 

1 ,*2 >M — n 



{«l,«2,*3,«4}|=4,|u-i3l<T,l*2-i4|<T 

< c 



T 2 |t - S| 

n 2 



which is consistent with (44). In the case where, for instance, is\ < r and 
the other distances are larger than r, we apply the same method as in 2). 
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What changes is the block diagonal matrix Tq involved in Lemma 13. In fact, 
to evaluate E[ J e (X^ , Xi 2 ) J(Xi 3 , Xi i )] we expand J £ in Hermite polynomials, 
so that we need to control ¥,[H p {Xi 1 )J{Xi 3 , Xi 4 )H q (Xi 2 )]. We want to apply 
Lemma 13 with Y l = (X h , X ia , X u ), Y 2 = X h , h = H p J, f 2 = H q . We let 
ro,i be a 3 x 3 block diagonal matrix with a first block corresponding to 
the covariance matrix of the vector (X^ , Xi 3 ) and a second block equal to 
1, and we let Tq^ = 1, so that To is a 4 x 4 matrix. Observe that ||Tq 1 (r — 

r )|| 2 < 4||ro 1 (r - r )||oo, where nr^cr - r )||oo < (su Pl < fc < T/0 (A;))(i - 

sup fc>T p(fe))- 1 < 1/13, by (76). Thus, ||r„ \T - T )h < 4/13 < 1/3-r/, for 
some positive r). Hence, the condition on r* of Lemma 13 is satisfied. The 
remaining sum over indices where the distances between any two indices are 
larger than r can be addressed in the same way as when Condition (56) is 
satisfied. □ 

5.2. Lemmas used in the proof of Theorem 2. 
The following lemma proves joint convergence and provides the joint cumu- 
lants of the limits (Z 2 ,d(1), (^i,d(1)) 2 )- 

Lemma 15. Let (Xj)j>\ be a stationary process satisfying Assumption 
(Al) with D < 1/2 and let a and b be two real constants. Then, as n tends 
to infinity, 



k{D) 



n D ~ 2 
Lin] 



n 

an^Xf - 1) + b X i X i ) [« Z 2,d(1) + b{Z ljD (l)f 

i=l l<€,j'<n 



where — > denotes the convergence in distribution, k(D) = B((l — D)/2,D) 
where B denotes the Beta function, Zi t u(-) and ^2,d( - ) ore defined in (24) 
and (25) respectively. The cumulants of the limit process are given in (89). 

The proof of Lemma 15 is in the supplemental article Levy-Leduc et al. 
(2010c). 

Remark 5. It follows from Lemma 15 that E[Z^ ) i(l) 2 ] = a 2 where a 2 
is given in (92). Moreover, setting a = 1, b = in (89), we get the expression 
(30) for E[Z Di2 (l) 2 ]. 



U-PROCESSES UNDER LONG-RANGE DEPENDENCE 



29 



Lemma 16. Under Assumption (Al) there exists a positive constant C 
such that, for n large enough, 

D—2 n 

(78) ™ E[{ V Xi} 2 } < C , when D < 1 , 
L(n) ^ 

2D— 2 n 

(79) ?L_^[{£(X 2 - l)} 2 ] < C , when D < 1/2 , 

i=i 

and 

( 8 °) T7-^ E [{ y WjY] < C , when D < 1/2 . 

The proof of Lemma 16 is in the supplemental article Levy-Leduc et al. 
(2010c). 

Lemma 17. Suppose that the assumptions of Theorem 2 hold, in partic- 
ular D < 1/m where m = 1 or 2. Then, R n defined in (21) satisfies the 
following property. There exist positive constants a and C such that, for n 
large enough, 

(81) al E[(R n (t) - R n (s)) 2 } < , for all s,t € I , 

where I is any compact interval ofM. and a n = n mD l 2 ~ 2 L{ri)~ m l 2 . 

The proof of Lemma 17 is in the supplemental article Levy-Leduc et al. 
(2010c). 

Lemma 18. Under the assumptions of Theorem 2, R n defined in (21) 
satisfies, as n tends to infinity, 

supa n |i? n (r)| = op(l) , 

where a n = n~ 2+mD / 2 L(n)~ m l 2 , and m is the Hermite rank of the class of 
functions {h(-, -,r) — U(r) , r G 1} which is equal to 1 or 2. 

The proof of Lemma 18 is in the supplemental article Levy-Leduc et al. 
(2010c). 
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5.3. Proof of Theorem 2. Consider the decomposition (21). Since G(x, y) = 
G(y,x), one has ai t o(r) — a o,i( r )i ^.oW — &o,2(,r) and W n defined in (22) 
satisfies 

n 

(82) W n (r) = 2(n - l)ai, (r) ^TXi , if m = 1 , 

i=i 

n 

(83) W n (r) = ai,i(r) ^ + (n- l)a 2)0 (r) ^(^ 2 - 1) , if m = 2 . 

l<ij^j<n i=l 

If m = 1, using Lemma 5.1 in Taqqu (1975), if r is fixed, n Z3 ' /2 ~ 2 L(n)~ 1//2 Vl / n (r) 
defined in (82) converges in distribution to 2k(D)~ 1 ^ 2 ai^(r)Zi j £>(^)- Then, 
by the Cramer- Wold device, if r\, . . . ,r k are fixed real numbers, 
k(D) l / 2 n D l 2 ~ 2 L(n)~ 1 / 2 (W n (ri) , . . . , W n (r k )) converges in distribution to 
(2a lt0 (r 1 )Z 1)D (l), ■ ■ ■ ,2a 1)0 (r k )Z 1)D (l)). In the same way, if m = 2, using 
Lemma 15 in Section 5.2 and the Cramer- Wold device, 
k(D)n D ~ 2 L(n)~ 1 (W n (ri), . . . , W n (r k )) converges in distribution to 
(ai,i(r 1 )(Z 1 , D (l)) 2 +a 2i0 (r 1 )Z 2iD (l), . . . J a lil (r fc )(Z liD (l)) 2 +a 2 , (r fc )Z 2iD (l)). 
We now show that {n mD l 2 ~ 2 L(n)~ m l 2 W n (r)]r G 1} is tight in V(I). We 
shall do it in the case m = 1. By (105), Lemma 16 in Section 5.2 and 
the fact that A is a bounded Lipschitz function, we get that there exists a 
positive constant C such that for all r\ < r 2 in I, 

(n D / 2 - 2 L( ? i)- 1 /2 ) 2 ]E[{ ^ n(r2) _^ n(ri)} 2 ] < C( A (r2 )_A( ri )) 2 < C\r 2 - n \ 2 . 

Using the Cauchy-Schwarz inequality, we obtain that for all n, r 2 , r3 in /, 
such that r\ < r 2 < r$, 

W n (r 3 )-W n (r 2 )\ 
< C\r 2 - ri\\r 3 - r 2 | < C|r 3 - n| 2 . 

The tightness then follows from Theorem 15.6 of Billingsley (1968). A sim- 
ilar argument holds for m = 2. Thus, {n mD / 2 ~ 2 L(n)~ m / 2 W n (r);r G /} 
converges weakly to {2a\fi{r)k{D)~ 1 / 2 Z\^£){1)] r G /}, if m = 1 and to 
{fc(D)- 1 [ai 5l (r)Zi iD (l) 2 + a 2 , (r)Z 2iD (l)] ;r G /}, if m = 2. To complete 
the proof of Theorem 2 use (21) and Lemma 18 in Section 5.2, which ensures 
that sup rg7 n mD / 2 - 2 L(n)~ m l 2 \R n (r)\ = op(l), as n tends to infinity. 



(n D / 2 ~ 2 L(n)~ 1 / 2 ) 2 E W n (r 2 ) - W n {n) 
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SUPPLEMENTARY MATERIAL 

Proofs of Lemmas 9, 10, 12, 13, 14, 15, 16, 17 and 18 and some 
numerical experiments. 

(http://lib.stat. emu. edu/aoas/???/???). This supplement contains proofs of 
Lemmas 9, 10, 12, 13, 14, 15, 16, 17 and 18 and a section containing numer- 
ical experiments illustrating some results of Section 4. 
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Supplement to paper "Asymptotic properties of U-processes 
under long-range dependence" 

This supplement contains proofs of Lemmas 9, 10, 12, 13, 14, 15, 16, 17 
and 18 and a section containing numerical experiments illustrating some 
results of Section 4. 

Proof of Lemma 9. Let us check that the assumptions of Theorem 9 
in Arcones (1994) hold for the class J- of functions {h\(-,r) : r G 1} which 
is of rank r > m = 2 > 1/D. By Assumption (Al) and since r > 1/D, the 
condition (i") of Theorem 9 in Arcones (1994) is satisfied. We conclude the 
proof of the Lemma by observing that the condition (ii) of this Theorem is 
also fulfilled. To check this condition, we have to prove that 



302. 



1 80-83. 
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where \ (e, T) is the bracketing number of the class T as defined on page 
2269 in Arcones (1994): 

N, Y (e, T) = mm.{N : 3 measurable functions fi, ■ ■ ■ , /n and A l5 . . . , Ajy 
such that for each / G J 7 , 3i < N such that |/j — /| < Aj and where 

E(Af(X)) < e 2 for each i < N} . 

Let = 0, . . . , N} be such that: for all i, |rj — rj_i| < e/C and for all 
r £ I, there exists i such that \r — rj| < e. The smallest ./V satisfying this 
property is at most equal to [|/|C/e] + l, where |/| denotes the length of /. Let 
us define for all i> 1, ft = h\(-, n) and Aj = h\(-, ri) — hi(-, r,j_i). Using (19) 
we first g etE(A?pQ) = E({h l (X,r i )-h l (X,r i ^ 1 )} 2 ) < C^r.-r^ 2 < e 2 . 
Now, let r G I and 1 < i < N be such that, r,_i < r < rj. Then, using 
the fact that h\ is increasing with respect to its second argument leads to 
\fi ~ hi(-,r)\ < Aj. Thus, N^ 2 ? (e, J 7 ) < [\I\C/e] + 1 which yields condition 
(ii) of Theorem 9 in Arcones (1994). □ 

PROOF of Lemma 10. We want to apply Lemma 5.2, P. 4307 of 
Borovkova, Burton and Dehling (2001) to {y/nR n (r),r G /}. To do this, we 
prove that for all s, t G /, 6 > such that s < t < s + 5 and s + 5 G /: 

(84) V^\Rn(t) ~ Rn(s)\ 

< sfr\R n {s+5)-R n {s)\ + 2^\W n {s+5)-W n {s)\+A^i\U{s+5)-U{s)\ . 

Using the definition of R n given by (8) and the fact that h, h± and U are 
nondecreasing functions with respect to r, we get: 

R n (t)-R n (s) < \ Yl {hiXitXjrf-hiXiiX^afi+iUW-Uia)} 

< , 1 7T V {h(X i ,X J ,s + S)-h(X u X j ,s)} + {U(s + 5)-U(s)} . 
nin — 1) ^-^ 

By adding and subtracting functions h\ evaluated at s + 5 and s, we obtain: 



2 n 



R n (t) - R n (s) < {R n (s + S)-Rn(s)} + - Y,{hi{Xh s + 8) - hi(Xi, s)} . 



n . 
i=i 



Adding and subtracting 2(U (s) — U(s + 5)) leads to: 

R n (t)-R n (s) < {R n ( s+ 5)-R n (s)}+{W n (s+5)-W n (s)}+2{U(s+5)-U(s)} 
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where W n is defined in (7). Thus, 
(85) 

R n {t)-R n {s) < \R n {s+5)-R n {s)\ + \W n {s+5)-W n {s)\ + 2\U{s+5)-U{s)\ . 

Let us now find an upper bound for R n {s) — R n (t). Starting with the expres- 
sion (8) for R n (r) and setting h(Xi,Xj,s) < h(Xi,Xj,s+S), U(s) < U(s+5) 
and h(Xi,Xj,t) > h(Xi,Xj,s), U(t) > U(s) since h and U are non decreas- 
ing functions with respect to r, we obtain 

R n (s)-R n (t)< 1 V [{h{X i ,X j ,s + S)-h{X i ,X j ,s)} 
n(n — 1) 

l<i^j<n 

- 2{h 1 (X i , s) - htiX^t)}} + {U(s + 5)- U(s)} 

I L \ I i 1 I 

-2{7n(X i ,a + <y)-7ii(X i ,a)}] 

+ p 2 rr V [{fei (Xi,t)-fci(Xi, «)} + {/* s + 5) ->n 
n(n — 1) 

+ + 5) - 17(a)} 
< {i? n ( S + 5)- J R n ( S )} + - V {/np^s + ^-^X;,*)}. 

n ^ — * 



n 

Ki<n 



Adding and subtracting 4(U(s) — U(s + 8)) leads to: 
(86) 

R n (s)-R n {t) < \R n (s+5)-R n (s)\+2\W n {s+5)-W n (s)\+4\U{s+5)-U{s)\ 

Combining (85) and (86), we get for all s,t 6 I, S > such that s < t < s + 5 
and s + 5 £ I: 

V7i\R n (t) - R n {s)\ < V^\R n (s + 6)- R n (s)\ + 2^\W n (s + 5)- W n (s)\ 

+ iV^\U(s + 5)-U(s)\ , 

which is (84). Remark that U is Lipschitz by Condition (19). In Lemma 5.2, 
P. 4307 of Borovkova, Burton and Dehling (2001), the monotone Lipschitz- 
continuous function A is here U, a = 1/2 and the process {Y n (t)} is here 
{y/nW n (t)}. We shall now verify that conditions (i) and (ii) of that lemma 
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are satisfied. Condition (i) holds because of Lemma If. Condition (ii) in- 
volves {y/nW n (t)}. Applying inequality (2.43) of Theorem 4 in Arcones 
(1994) to /(•) = (7n(.,i) - hx(-,s)) - (U(t) - U(s)), which is, by (14), of 
Hermite rank r > 2 > 1/D, we get using (19) that there exist some positive 
constants C and C' such that: 



E[\^i{W n (t)-W n (s)}\ 2 
2 



E 



< CE 



-= J2(hi(Xi,t) - h^Xu s)) - (U(t) - U(s)) 
{ vn i=i 

{(hiXut) - h^s)) - (U(t) - U(s))} 2 



< C It 



Thus condition (ii) of Lemma 5.2 in Borovkova, Burton and Dehling (2001) 
is satisfied with r = 2 and monotone function g(t) = t. An application of 
this lemma concludes the proof. □ 

Proof of Lemma 12. Using that, for n > 1, (H n _np)' = —H n ip, where 
' denotes the first derivative, and 6 integrations by parts (3 with respect to 
x and 3 with respect to y), we get that for p, q > 3, 



c P M) 



d b f(x,y)/dx i dy s H q _ 3 (y)cp(y)H p _ 3 (x)<p(x)dxdy 



where ip is the p.d.f of a standard Gaussian random variable. (45) then 
follows from the Cauchy-Schwarz inequality. □ 

Proof of Lemma 13. Let / be a function defined on R a i+ a 2 suc h that 
f(Y) = f(Y l ,Y 2 ) = h(Y l )h{Y 2 ). Note that E[fjY)] = E[/(rJ /2 Z)], where 
the covariance matrix of Z is equal to r o 1 ^ 2 rr o 1 ^ 2 . By the assumption on 
r*, the latter matrix is invertible and satisfies: 



(r; 



-1/2 



rr; 



-l/2x-l 



[i ai - 



T)T 1/2 ) k . Let A := (T, 
of the multivariate Gaussian distribution and the definition of the matrix 
A, we obtain that 



02 M) 



1/2, 



-r)r- 1/2 ]~ 1 = / Ql+ a 2 +E fc >i(ro 1/2 (r c 

I ai +a 2 - By definition of the density 



-1/2 



rr, 



-l/2wl 



1/2 


1/2 



f(T Q ' z) expi-z 1 Az/2) exp(-z J z/2) 



dz 



( 27r )(ai+a 2 )/2 • 
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Expanding exp(— z T A.z/2) in series leads to 

|r 1/2 rro 1/2 | 1/2 iE[/(ro /2 ^)] 

( 2vr )(ai+a 2 )/2 • 



£ t Hr~ I f^z^Azf exp(-,^/2) 

k>0 JK 



Set v = [(r + l)/2], where [x] denotes the integer part of x. Using that / is 
of Hermite rank at least r and the previous equation, we get 

|r~ 1/2 IT~ 1/2 | x / 2 E[f(rl /2 z)} 



£ t TT- I f(rl /2 z)(z T Az) k exp(-/V2) 



( 2vr )(ai+a 2 )/2 • 



Since | E fe >^(-l/2) fc (^ T A^) fe /A;!| < \z T Az\" exp(|z T Az|/2)/(2^!), we ob- 
tain 

|r - 1/2 rr - 1/2 | 1/2 |E[/(r; /2 z)]| 

£ 2^ jL. ' /<r;/22)l l^^l- -Pfl^l/*) exp(- Z ^/2) (2;r)(a d ^ a)/2 
Denoting by 5 the spectral radius of A gives 

| r -i/2 rr -i/2|V2 | E [/(ry 2 z)]| 



< T / l/( r o z)\(z 1 z) u exp{(5/2 - l/2)z 1 z}- 

By the Cauchy-Schwarz inequality, we get 

(87) |ro 1/2 rr- 1/2 |V2 |E[/(rfz)]| 

dz \ 1 ' 2 



< 



( [ (z T z) 2 »e W {(8-l/2)z T z} , 



1/2 



By definition of A, the spectral radius S of A satisfies 5 < /Cfc>i( r *) fc — 
r*/(l — r*), where r* is the spectral radius of T 1//2 (To — T)r o 1 ^ 2 . By as- 
sumption on r*, 5 < 1/2 — 3e/2 which implies that the second integral in 
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(87) is convergent. The first integral in (87) satisfies 

dz N l ' 2 



||/i||2,r ,ill/: 



2||2,r ,2 



Finally, under the assumption on r*, the spectral radius 5$ of 



(r^nV'T 1 = E^o-fiV^ro-r^T satisfies s < £ 

l/(l-r*),sothat |ro 1/2 rro 1/2 |~ 1/2 < (l-r*)"( ai+a2 )/ 2 < (2/3+e)"( ai+a2 )/ 2 < 
(3/2)( ai+a2 )/ 2 . This establishes (46). □ 

Proof of Lemma 14. Let I denote the 2x2 identity matrix. One can 
express Tn as Tn = L a L^, where T denotes the transpose so that the vector 
(Xi,X2) T = L~ 1 (Xi, X2) 7, has covariance matrix fn = L~ 1 Tn(L~ 1 ) T = I. 
Similarly, T22 = L Lj so that (X^,X^) T = L b ~ 1 (X 3 , X±) T has covariance 
matrix T22 = L Then 

E r [J a (X 1 ,X 2 )J b (X 3 ,X 4 )} =E r [J a (X 1 ,X2)J b (X 3 ,X i )} , 

where J a = J a L a , J b = J b ° L b and 



-1/2n-1 



-1/2 



1/2-ifc 



k>0\ 



I 



L: 







l; 



rii r 12 

^21 ^22 



1\T 







(L b r 



1\T 



Observe that 



P12 V 1 -P12 



> K 



-1 



P12/V 1-^12 VV 1 -Pl2 



since L a L^ = Tn. A similar expression holds for L b with p±2 replaced by 
P34. Observe that 



r i2 — L a 1 ri 2 (L i 



-1\T 



-P12/V 1 - P12 V\A - P12 





P13 


P14 






P23 


P24 





1 -^/ yi - P 34 
l/vr^ 

Pl3 P14 
P23 P24 
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where the pij are given in the statement of the lemma. This characterizes 
the matrix T since f n = T22 = I and f 21 = f^ 2 - The relations involving p* 
and p* follow from those relating the p^-'s to the pki's. Finally, one has 

E l [J a (X 1 ,X 2 ) 2 ]=M Tn [J a (X 1 ,X 2 ) 2 } , 

and also the other similar type relations. □ 

Proof of Lemma 15. We first prove that, for p > 2, the pth cumulant 
k p of ciZ2 ! d(1) + b(Zi : £)(l)) 2 is equal to 

(89) K p = 2 p - 1 (p-l)\ k(D) p f d«i...d«p / d Vl ...dv p 

P 

~J[a5(v,j — Vj) + b] \uj — Vj-i\~ D , with vq = v p , 
i=i 

where k(D) = B((l - D)/2,D), 5(x) = 1 if a? = 0, and $(s) = else. Using 
(24) and (25), 

aZ %D (l) + fe(Z ljD (l)) 2 = / K(x, y)dB(x)dB(y) + 6a 2 , 

where 

ET(x,y)= / / [aS(u-v) + b](u - x)~ iD+1)/2 (v - y)~ {D+1)/2 dudv , 
and 

/• /•! 0+1 

a 2 = S[Z 1|£> (l) 3 ]= /[/ (u-x), 2 d«] 2 dx 



Jo 








1 /■! 



J 



0+1 



(it — a;) 1 2 (v — x) , 2 dccjdudi) . 



Using that for < a < 1/2, 

p poo 

(90) / (u - x)%-\v - x) a + ~ l dx = \u- v\ 2a ~ l / y a -\l + y) a ^dy 
Jr Jo 

= |n-i;| 2a - 1 B(a,-2a + l) , 
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where B(-, •) denotes the Beta function, we get 

(91) [(n-x)^ D+1 ^(v-x)^ D+1 ^dx 

= B((1- D)/2,D)\u-v\- D = k{D)\u-v\~ D . 

Thus, 

(92 , ^P)f(i.-.r» u ^ F A_. 

Hence, using Proposition 4.2 in Fox and Taqqu (1987), we have that for 

JRP 

By definition of K, and with the convention x p+ \ = x%, 

i K(xi,x 2 )K(x 2 ,x 3 ) . . . K(x p -i,Xp)K(xp, xi)dx 1 . . . dx p 

J MP 

= / dui . . . du p / dv± . . . dvp / dxi . . . dx p 

J\QA]P J\0,1]P JRP 



H[a5( Uj - Vj ) + b]{uj - x 3 y + {D+1),2 { Vj - x i+1 )~( D+1 )/ 2 



du\ . . . dup I dv\ . . . dvp 

[o,i]p ^[o,i]p 



f\[a5{ Uj - v s ) + b] [ ( Uj - Xj r + {D+ ^\ Vj ^ - Xj )- {D+1)/2 
j= i 

where vq = v p since Xj is associated with Uj and Vj—\. Using (91), we obtain 
the expression (89) for the cumulants K p , p > 2. Let us now compute the 
limit as n tends to infinity of the cumulants of 



n D ~ 2 r „„ . n n D ~ 2 



A n = —— \X'(anl + bll')X] = -— X'Y}I 2 (aril + 611') T}' 2 X 
L[n) L(n) L J . 

where X = (X\, . . . , X n )', 1 = (1, . . . , 1)', I is the n x n identity matrix, S 
is the covariance matrix of X and X is a standard Gaussian random vector. 
Using Stuart and Ord (1987), p. 488, the pth cumulant of A n is equal to 

cum p = 2 p - 1 (p-l)\ Tr{BP) , 
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where B n = n D ~ 2 L{n)~ l [S 1 / 2 {anl + bll') S 1 / 2 ] . But 

/ D~2\P 
n D-2\P 

JjjTj) Z2 D h,jiP(h - i2)D i2j2 p(j 



2 - «3j 



D i P -id P -ip(jp-i ~ v)A pJp p(i P - k) , 

where p is defined in Assumption (Al) and Dij = an5(i — j) + b. With the 
convention i p+ \ = i±, 



l<i^ ,12 ,. . . ,ip<n £=1 

l<31,32,—jp<n 

P ( „D 



an5(ii - j £ ) + b\p(jt - 



t Yl f[ { Jjrfj -3t) + b]p(je-i - it) | 



l<il,i2.---.-Sp<n £=1 

l<iiJ2!-Jp<n 



where jo = jp- Thus, as n tends to infinity, 



/ dui . . . du p / dvi . . . dv p 

J\o,l]p J[0A] v 

P r 

I [a8(uj — Vj) + b] / \uj — Vj. 

7 i 



i-£) 
1 > 



with the convention vo = v p , which gives the expected result. □ 

Proof of Lemma 16. By Assumption (Al), p(k) = k~ D L{k). Using the 
adaptation of Karamata's theorem given in Taqqu (1975), one gets 



(93) if D < 1/m, £ \p(k)\ m ~ -n 1 ""^^))™ . 

^— ' (1 — mL>){2 — mu) 



\k\<n 

The bound (78) follows from 



E[{]Tx t } 2 ]<(n + £ | p (<_ i )|)<n(l+ £ |p(fc)|), 
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and by setting m = 1 in (93). Let us prove (79). Given that K[H p (Xi)H q (Xj)] = 
p\ 5{p — q)p(i — j) p , for all integers p, q, i,j > 1, we obtain 

E[{f> 4 2 - I)} 2 ] = E[ HiiXJHiiXi)] 

1=1 

= 2n + 2 P(i-i) 2 <2n(l+ ]T p{kf) . 

The bound (79) follows by using that D < 1/2 and (93) with m = 2. Let us 
now prove (80). Note that 

(94) E[( Y XX j f} = E HWjXkXi) 

l<k^£<n 

= HXfXj)+ Y H^iXjXkX^+Q Y nxfX,X e ). 

l<i^j<n l<i,j,k,l<n l<i,j,l<n 

\{i,j,k,t}\=A \{i,3,l}\=Z 

Writing Xf = H 2 (Xi) + 1 and using that E[H 2 (X i )H 2 {X j )] = 2p(i-j) 2 , for 
all hj > L the first term in the r.h.s of (94) satisfies 

Y E(X?X])<n 2 + 2n Y ■ 

Using Lemma 3.2 P. 210 in Taqqu (1977), the second term in the r.h.s of 

(94) satisfies, for some positive constant C, 

Y nxiXjX k x t ) < cn\ y \p( k )\f ■ 

i<i,j,k,e<n \k\<n,k^0 

\{i,j,k,e}\=A 

Writing X? = H 2 (Xi) + 1 and using Lemma 3.2 P. 210 in Taqqu (1977), the 
third term in the r.h.s of (94) satisfies, for some positive constant C, 

Y E(XfXjX t ) < Cn{ Y \p{k)\? +n 2 Y \P^ k )\ ■ 

l<i>j.^<™ |fc|<n |fc|<n 

|{i,i/}|=3 

The last three inequalities lead to the expected result by using (93). □ 

Proof of Lemma 17. Set a Piq (s,t) = a Ptq (t) - a PA (s) for all s,t in R, 
where a p>q {-) is defined in (10). Then 

(95) K[(R n (t) - R n (s)) 2 } = Y nJs,t{ X hi X i2)JsA X i 3 ,X i4 )], 

1 7^2 — 71 

l<i3/«4<n 
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where for all x, y in R and s, t in /, 



(96) J Sjt (x, y) = (h(x, y, t) - h(x, y, s)) 

(97) J s , t {x, y) = {h(x, y, t) - h(x, y, s)) 

~ \i a 2fi{t) - a 2 ,o(s))(x 2 + y 2 



(a 1)0 (t) - a 1>0 (s))(x + y) 

-(U(t)-U(s)) , if m= 1 , 

(«l,l(t) - a^i(s))xy 

2) - (U(t) - U{s)) , if m = 2 . 



To obtain these relations express R n (t) — R n (s) using (21), (5) and (22). We 
now consider 3 cases, depending on the cardinality of the set {ii, 12, 13, 14}- 

1) We start with the case of cardinality 2. Let us address the case where 
the sum is over the set of indices (3, 14} such that i\ = 13 and i 2 = U- 
We shall only focus on the case where m = 1 because the case m = 2 could 
be addressed in the same way. We thus need to show that 

(98) mR n {t) - R n (s)) 2 } < Cn 4 - D ~ a \t - s\ . 

Using that h, U and ai t o are bounded functions, there exists a positive 
constant C such that 

+|ai >0 (t) - a li0 (s)\{(X h + X i2 f + \X h + X i2 \} + \U(t) - U(s)\) . 

Since Condition (26) holds and U, A, defined in (27) are Lipschitz functions, 
there exist positive constants C\ and C 2 such that 

Y nJi t (x H ,x l2 )] 

l<il^i2<n 

< dn(n - l)\t - s\ + C 2 \t - s\ Y ^[(Xi, + X i2 ) 2 + \X i2 \ + \X i2 \] 

l<il^i2<n 

< Cn(n - \)\t - s\ , 

which gives (98). 

2) Let us now address the case where the sum is over the set of indices 
{^l) ili *3> h} having a cardinal number equal to 3 i.e. for instance when 
13 = i±. As previously, we focus on the case where m = 1. Using that h, U 
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and ct\fi are bounded functions, Condition (26), the Lipschitz property of U 
and A, there exists a positive constant C such that 

E E[J S)t (X n ,X 2 )J M (X n ,X 4 )] < Cn 3 \t -s\, 

l<ij 7^ *2 — n 

l<il^i4<n 

which gives (98). 

3) Let us now consider the case where the sum is over the indices ii, «2? *3j *4 
such that the cardinal number of the set {ii, 13, £4} is equal to 4. This case 
is similar to the case 3) in the proof of Lemma 11. We need to show that 

(99) a 2 n Yl nJ(X n ,X l2 )J(X i3 ,X l4 )]<C 1 -^, 

1^*1 >*3 >*4 — n 
|{il,t2,i3,»4}|=4 

where a n = n mD l 2 ~ 2 L{n)~ m l 2 , J = J Sji is defined in (96) if m = 1 and 
in (97) if m = 2, C > and a > 0. We will only present the case m = 1. 
The idea is, once again, to replace (X^ , Xj 2 , Xj 3 , Xj 4 ) by (Xj 4 , Xj 2 , Xj 3 , Xj 4 ) 
using Lemma 14, so that (66) holds with J, J^^, Ji 3 ,i 4 replaced by J, 
Ji u i 2 , Ji 3 ,u an d to expand ,i 2 (X^ , Xj 2 ) and Ji 3j i 4 (Xj 3 ,X i4 ) in Flermite 
polynomials as in (67), with an expansion up to K, thus defining J^ i2 and 
^hu' Then, (68) holds with J replaced by J. Denoting again the coefficients 
of the expansion by Cpx&i an d c P3 ',p 4 respectively, one needs to majorize the 
right-hand side of (69). Assuming that \i\ — i 2 \ is the smallest distance 
between two different indices, namely \i\ — i 2 \ = min{|zi — i 2 \, \h — h\, \h — 
*4 1 , 1 *2 — *3 1 5 1 *2 — ^4 1 5 1 ^3 — h\}, w e obtain that the pjj's are bounded by 
4p(ii — «2)/(l ~~ p(h ~ ^) 2 )- Using that there exist positive constants C and 
e such that \p(k)\ < C(l A \k\- D+e ) =: >y(k), for all k > 1, we get, by (47), 
that the py's are bounded by 

(100) 4 7 (ii - i 2 )/(l - 7(*1 " ^) 2 ) =: 7(n " *a) • 
By Lemma 3.2 in Taqqu (1977), we obtain 

Ep [i7 pi (Xjj )iT P2 (X i2 )H P3 (X i3 )H pi (X i4 )] 

< Cj(h -i 2 ) 5 \E[H pi (X)H P2 (X)H P3 (X)H m (X)}\ . 

Using (72), EAJK^X^, X l2 )J* M {X l3 ,X u )\ is bounded by 

E ^(ii-y)^ E -^(3^1-^))^ • 

l<pl,P2<K 1<P3,P4<A 



44 C. LEVY-LEDUC, H. BOISTARD, E. MOULINES, M. TAQQU AND V. REISEN 
Using the Cauchy-Schwarz inequality, the first sum is bounded by 

Eifo.i^.Xfc) 2 ] 172 ( I>7(*1- W 

\p>i 

and similarly for the second sum. It follows from Lemma 14 that 
E f g.(4,X i2 )^.(l i3 ,4)] is bounded by 

Er^JiX^Xi.f^Er^JiX^Xi,) 2 } 1 / 2 ^-i^f 

\p>i 

Since Condition (26) holds and U, A, defined in (27) are Lipschitz functions, 

^[JiX^X^ff^Er^JiX^Xi,) 2 ) 1 / 2 < C\t - s\ . 
We deduce from the previous inequalities that 
a 2 n J2 nAX h ,X h )J(X h ,X i4 )\ 



1^*1 '*3 '*4 — n 
|{21,22,23,m}|=4 



<Cn 3 a 2 n \t-s\J2 E( 3 ^)) ? 



fc|<n \P>1 

where 7 is defined in (100). Let rj be a positive constant such that ?] > 3. 
Then there is K > 1 such that t]^{k) < 1, for all k > K. We may suppose 
without loss of generality that K = 1, that is 777 (fc) < 1, for all fc > 1. We 
then obtain that for large enough n, 
(101) 

/ \ 



a 2 n E[(R n (t) - R n (s)) 2 ] < C V 2 \t - s\ a 2 n n 3 



E 7(*) s 



\k\ <n 

Wo / 




Observe that a 2 ?i 3 = n D ~ l L{n)~ l . Recall also that j(k) is defined in (100) 
and note that ^2ikl{k) 2 ma Y be finite or infinite. If ^^^(k) 2 < 00 then 
the result (81) follows with a = 1 — D which is positive since D < 1. If 
Sfc7(^) 2 = 00 then X)|fe|<n 7(^) 2 ~ 16n 1-2D+2e and the result follows with 
a = D — 2e if e is chosen small enough to ensure that this quantity is 
positive. □ 
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PROOF of Lemma 18. We want to apply Lemma 5.2, P. 4307 of 
Borovkova, Burton and Dehling (2001) to {a n R n (r),r £ /}. To do so, we 
first prove that for all s, t € 1, 5 > such that s < t < s + 5 and s + 5 € /: 

(102) _ 

a n \R n (t)-R n (s)\<a n \R n (s+5)-R n (s)\+2a n \A(s+5)-A(s)\\ ^ X;| 

+ 2a n n(n - l)\U(s + 5)- U(s)\ , if m = 1 and D < 1 , 
where A is defined in (27) and 

(103) a n \R n (t) - R n (s)\ < a n \R n (s + 5)- R n (s)\ + 2a n \A(s + 5)- A(s)| 
[| Yl X i X i\ + \ Yl (X? -l)\}+2a n n(n-l)\U(s + 5)-U(s)\ , 

l<ij^j<n 1 < <n 

if m = 2 and D < 1/2 . 

Let us focus on the proof of (102), where m = 1 and D < 1, since the proof of 

(103) can be obtained by using similar arguments. In view of the definition 
(21) of R n and the fact that U n and U are non decreasing functions, we 
obtain 

R n (t)-R n (s) < R n {s+5)-R n {s)+W n {s+5)-W n {t)+n{n-l){U{s+5)-U{s)) . 

Remark that the monotonicity of h in (27) implies that A is a non decreasing 
function and that for p + q < 2, 

(104) |ap iff («) - Op, ff (r)| < \A(s) - A(r)| , for all r, s . 

Since p + q < 2 and m = 1, we need to consider only p = 1, q = and p = 0, 
q = 1 in (22), we thus get 

(105) Wn(s + <5) - Wn(t) < 2(A(s + 5) - A(s)) | ^ X t \ . 

In the same way, after switching s and £, we obtain 
Rn(s)-R n (t) <R n (s + 5)-R n (s) + 2(A(s + 5)-A(s))\ X i\ 

+ 2n(n-l)(Z7(a + *)-?/(«)) , 
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which gives (102). In Lemma 5.2, P. 4307 of Borovkova, Burton and Dehling 
(2001), the monotone Lipschitz-continuous function A is here 2U, the process 
{Y n (t)} is {anMtfiYsiKijtjKnXi)} if m = 1, and if m = 2, the process 
{Y n (t)} is {a n A(t)(Ei<i? j <n X i X 3^+ Ei<i^<n( X f ~ !))}■ Using Lemma 
16 and the fact that the function A defined in (27) is a Lipschitz function, 
the processes {Y n (t),t S /} defined above satisfy the condition (ii) of Lemma 
5.2 in Borovkova, Burton and Dehling (2001) with r = 2. Using Lemma 17, 
the condition (i) of Lemma 5.2 in Borovkova, Burton and Dehling (2001) is 
also satisfied. This concludes the proof. □ 

6. Numerical experiments. In this section, we investigate the robust- 
ness properties of the Hodges-Lehmann and Shamos scale estimators defined 
in Section 4 using Monte Carlo experiments. We shall regard the observa- 
tions X t , t = 1, . . . ,n, as a stationary series Y t , t = 1, . . . ,n, corrupted by 
additive outliers of magnitude u. Thus we set 

(106) X t = Y t + uW u 

where Wt are i.i.d. random variables. In Section 6.1, Wt are Bernoulli(p/2) 
random variables. In Section 6.2, Wt are such that P (Wt = — 1) = P (Wt = 1) 
= p/2 and P (W t = 0) = 1 - p, hence E[W t ] = and E[W?] = Var(W t ) = p. 
Observe that, in this case, W is the product of Bernoulli(p) and Rademacher 
independent random variables; the latter equals 1 or —1, both with prob- 
ability 1/2. (Yt)t is a stationary time series and it is assumed that Yj and 
Wt are independent random variables. The empirical study is based on 5000 
independent replications with n = 600, p = 10% and u = 10. We consider 
the cases where (Yt) are Gaussian ARFIMA(1, d, 0) processes, that is, 

(107) Y t = (I-<j>B)- 1 (I-B)- d Z t , 

where B denotes the backward operator, (j> = 0.2 and d = 0.1, 0.35, corre- 
sponding respectively to D = 0.8, 0.3, where D is defined in (Al) and (Zt) 
are i.i.d AA(0, 1). 

6.1. Hodges-Lehmann estimator. In this section, we illustrate the results 
of Proposition 5. In Figure 1, the empirical density functions of 9hl an d X n 
are displayed when Xt has no outliers with d = 0.1 (left) and d = 0.35 (right). 
In these cases both shapes are similar to the limit indicated in Proposition 
5, that is, a Gaussian density with mean zero. 
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Figure 2 displays the same quantities as in Figure 1 when Xf has outliers 
with d = 0.1 (left) and d = 0.35 (right). As expected, the sample mean is 
much more sensitive to the presence of outliers than the Hodges-Lehmann 
estimator. Observe that when the long-range dependence is strong (large d), 
the effect of outliers is less pronounced. 



0.12 , 1 1 1 1 1 1 1 1 1 0.18 




2.5 3 



Fig 2. Empirical densities of the quantities 8 h l ('*') and X n ('o') for the ARFIMA(l,d,0) 
model with d = 0.1 (left), d = 0.35 (right), n — 600 with outliers (p — 10% and to = 10). 



6.2. Shamos scale estimator. In this section, we illustrate the results of 
Proposition 8. In Figure 3, the empirical densities of &bl — o~ and o n ^x — o~ 
are displayed when d = 0.1 without outliers (left) and with outliers (right). 
In the left part of this figure, we illustrate the results of the first part of 
Proposition 8 since both shapes are similar to that of Gaussian density with 
mean zero. On the right part of Figure 3, we can see that the classical 
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scale estimator is much more sensitive to the presence of outliers than the 
Shamos-Bickel estimator. 

0.16 n, ° 
0.14 
0.12 
0.1 
0.08 
0.06 
0.04 
0.02 
Q 

-0.2 -0.15 -0.1 -0.05 0.05 0.1 0.15 0.5 1 1.5 2 2.5 3 

Fig 3. Empirical densities of the quantities (obl — cr) ('*') and (<Xn,x — cr) ('o'J for the 
ARFIMA(1, d,0) model with d — 0.1, n = 600 without outliers (left) and with outliers 
p — 10% and u) — 10 (right). 
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Fig 4. Empirical densities of the quantities (ctsb — cr) ('*') and (a n ,x — cr) C°') f or the 
ARFIMA(1, d,0) model with d = 0.35, n = 600 without outliers (left) and with outliers 
p = 10% and oj = 10 (right). 

Figure 4 (left) illustrates the second part of Proposition 8. d = 0.35 cor- 
responds to D = 0.3 < 1/2. The right part of Figure 4 shows the robustness 
of the Shamos estimator with respect to the classical scale estimator in the 
presence of outliers. 
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